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, Consider a random sample from a bivariate distribution function 

^0 ' F m the max-domain of attraction of an extreme-value distribution 

function G. This G is characterized by two extreme-value indices 
and a spectral measure, the latter determining the tail dependence 
structure of F. A major issue in multivariate extreme- value theory 
E"^ , is the estimation of the spectral measure <£> p with respect to the 

^0 ' L p norm. For every p £ [l,oo], a nonparametric maximum empiri- 

pH . cal likelihood estimator is proposed for <J? P . The main novelty is that 

these estimators are guaranteed to satisfy the moment constraints 
by which spectral measures are characterized. Asymptotic normal- 
ity of the estimators is proved under conditions that allow for tail 
independence. Moreover, the conditions are easily verifiable as we 
demonstrate through a number of theoretical examples. A simulation 
■ study shows a substantially improved performance of the new esti- 

' mators. Two case studies illustrate how to implement the methods 

OO ■ m practice. 
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1. Introduction. Let F be a continuous bivariate distribution function 
in the max-domain of attraction of an extreme-value distribution function 
G. Up to location and scale, the marginals of G are determined by the 
extreme- value indices of the marginals of F. The dependence structure of G 
can be described in various equivalent ways; in this paper we focus on the 
spectral measure $ introduced in de Haan and Resnick (1977). The spectral 
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or angular measure is a finite Borel measure on a compact interval, here 
taken to be [0,7r/2]. 

Given a random sample from F, statistical inference on the upper tail of 
F falls apart into two pieces: estimation of the upper tails of its marginal 
distributions, which is well understood, and estimation of <3?, which we will 
consider in this paper. The actual representation of the spectral measure 
depends on the norm used on M 2 ; here we will consider the L p norm for 
every p E [l,oo], with & p denoting the corresponding spectral measure. The 
most common choices in the literature are p = 1, 2 and oo. 

It is the aim of this paper to derive a nonparametric estimator of the 
spectral measure, superior to its predecessors, and to establish its asymp- 
totic normality. In Einmahl, de Haan and Piterbarg (2001), a nonparamet- 
ric estimator was proposed for §oo- This estimator, which we will refer 
to as the empirical spectral measure, was shown to be asymptotically nor- 
mal under the assumption that has a density, excluding thereby the 
case of tail independence. Moreover the empirical spectral measure is itself 
not a proper spectral measure because it violates the moment constraints 
characterizing the class of spectral measures. A related estimator in a more 
restrictive framework was proposed in Einmahl, de Haan and Sinha (1997). 

The contributions of our paper are threefold: first, to propose a nonpara- 
metric estimator for the spectral measure which itself satisfies the moment 
constraints; second, to allow for arbitrary L p norms, p£ [l,oo]; third, to 
prove asymptotic normality under flexible and easily verifiable conditions 
that allow for spectral measures with atoms at or vr/2, including thereby 
the case of tail independence. We do this in two steps: first we define for 
every p£ [1, 00] the empirical spectral measure $ p and extend the results in 
Einmahl, de Haan and Piterbarg (2001) under the weaker conditions men- 
tioned above; second, we use a nonparametric maximum empirical likelihood 
approach to enforce the moment constraints, thereby obtaining an estima- 
tor $ p that is itself a genuine spectral measure. A simulation study shows 
that the new estimator & p is substantially more efficient than the empirical 
spectral measure <3? p . 

As the new estimator takes values in the class of spectral measures, it 
can be easily transformed into estimators for the aforementioned other ob- 
jects that can be used to describe the dependence structure of G. This 
holds in particular for the Pickands (1981) dependence function and the 
stable tail dependence function [Huang (1992), Drees and Huang (1998) 
and Einmahl, de Haan and Li (2006)]. For a general background on spec- 
tral measures and these dependence functions as well as results for the 
corresponding estimators, see for instance the monographs Coles (2001), 
Beirlant et al. (2004) and de Haan and Ferreira (2006). 

An alternative to the nonparametric approach in this paper is the para- 
metric one [Coles and Tawn (1991) and Joe, Smith and Weissman (1992)]. 
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Parametric models for the spectral measure are usually denned for p = 1 be- 
cause this choice tends to lead to simpler formulae. Many parametric models, 
such as the asymmetric (negative) logistic and the asymmetric mixed mod- 
els, allow the spectral measure to have atoms at and ir/2. 

The paper is organized as follows. In Section 2 we review the general 
probabilistic theory for spectral measures. The asymptotic normality re- 
sults for 3>p and $ p are presented in Sections 3 and 4, respectively. In Sec- 
tion 5 some theoretical examples are discussed and used in a simulation 
study; moreover, the methods are applied to a sample of insurance indem- 
nity claims taken from Frees and Valdez (1998) and to body measurement 
data of the National Health and Nutrition Examination Survey 2005-2006 
[National Center for Health Statistics (2007)]. Sections 6 and 7 contain the 
proofs of the results in Sections 3 and 4, respectively. 

2. Spectral measures. Let (X\,X 2 ) be a bivariate random vector with 
continuous distribution function F and marginal distribution functions F\ 
and F 2 . Put 

<"> Zi= T^m- 3=1,2 ' 

Define E = [0, oo] 2 \ {(0, 0)}. Assume that 

(2.2) sPv[ S - 1 (Z 1 ,Z 2 )e-} A /,(■), s^oo, 

where "A" stands for vague convergence of measures (in E): for every contin- 
uous /:E— > M with compact support, lim^oo sE[f(s~ 1 (Zi, Z2))] = J E fdfj,. 
The exponent measure f/, enjoys two crucial properties: homogeneity, 

(2.3) /i(c-) = c~V(-)« 0<c<oo, 
and standardized marginals, 

(2.4) fj,([z, 00] x [0, 00]) = /i([0, 00] x [z, 00]) = 1/z, < z < 00. 

Note that [i is concentrated on [0, oo) 2 \{(0,0)}, that is, /i([0, oo] 2 \ [0, oo) 2 ) = 
0. 

Let || • || be an arbitrary norm on M 2 ; for convenience, assume that || (1, 0) || = 
1 = ||(0, 1)||. Consider the following polar coordinates, (r, 8), of (21,22) £ 
[0,oo) 2 \{(0,0)}: 

r= ||(2i,2 2 )|| G (0,oo), 

(2.5) 

6 = aicta^zx / z 2 ) G [0,7r/2j. 

As we will see later, the choice of radial coordinate r through the norm 
has important implications; the choice of the angular coordinate is unim- 
portant, that is, we could just as well have used 2l/(2l + z 2 ) G [0,1] or 

Vll(*l,*2)||. 
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Given the exponent measure \i and using polar coordinates (r, 9) as in 

(2.5) , define a Borel measure $ on [0,7r/2] by 

(2.6) $(-) = ii({(zi,Z2) G [0,oo) 2 :r > 1,6 € ■}). 

The spectral measure $ admits the following interpretation in terms of 
(Zi,Z 2 ) in (2.1): 

(2.7) sPr[||(Zi,Z 2 )|| >s,arctan(Zi/Z 2 )G-] s^oo. 
By homogeneity of /x, see (2.3), for every /x-integrable /:E— >R, 

(2.8) fdfi= / /( Zl (r,0),^ 2 (r,0))r- 2 dr$(^), 

J[o,tt/2] Jo 

where Z\(r, 6) = r sin0/||(sin#, cos#)|| and 2 2 (r, 0) = r cos #/||(sin#, cos#)|| form 
the inverse of the polar transformation (2.5). By (2.8), in the polar coordi- 
nate system (r, 9), the exponent measure \i is a product measure r~ 2 dr <&(d9). 
In particular, the exponent measure \i is completely determined by its spec- 
tral measure <£>. The standardization constraints (2.4) on \i translate into 
moment constraints on <£: 

t sin# A\fj/}\ f cos 6 jf./in\ 

(2-9) J [0 ^/2] || (sin9, cos 9) || ( } = 1 = 7[o,V2] || (sin^, cos 0)|| ( } ' 

Note that Xi and X 2 are tail independent, that is, sPr[Zi > s,Z 2 > s] — > 
as s — > oo, if and only if fi is concentrated on the coordinate axes, or, equiv- 
alently, $ is concentrated on {0,7r/2}; in that case, $({0}) = 1 = ®({n/2}). 
On the other hand, the variables X\ and X 2 are completely tail dependent, 
that is, sPr[Zi > s, Z 2 > s] — > 1 as s — > oo, if and only if is concentrated 
on the main diagonal, or, equivalently, $ is concentrated on {7r/4}; in that 
case, $({7r/4}) = ||(1, 1)||. In general, the total mass $([0,7r/2]) of a spectral 
measure is finite but even for a fixed norm it can vary for different expo- 
nent measures fi, with one exception: in case of the L\ norm, by addition 
of the two constraints in (2.9), <J>([0,7r/2]) = 2 for every exponent measure 
fi. The spectral measure was introduced in de Haan and Resnick (1977); 
for more details on the results in this section see Beirlant et al. (2004) and 
de Haan and Ferreira (2006). 

Dividing the spectral measure $ by its total mass yields a probability 
measure Q on [0, 7r/2] 

(2.10) Q(.)=*(0/<&([O,7r/2]), 
which we coin the spectral probability measure. By (2.7) 

(2.11) Pr[arctan(Zi/Z 2 ) G ■ | \\(Z 1 ,Z 2 )\\>s]-^Q(-), s oo. 
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In words, Q is the limit distribution of the angle arctan(Zi/Z2) when the 
radius \\{Z\, Z 2 )\\ is large. The moment constraints (2.9) on $ are equivalent 
to the following moment constraint on Q: 

J[0,ir/2] || {smO, COS 0)\\ 7[0.7r/2] ||(sm6>,cos0)|| 

(2.12) 

= :m(Q). 

Conversely, we can reconstruct from Q by 
(2.13) <&(■) = Q(-)MQ)- 

The spectral probability measure Q allows nonparametric maximum likeli- 
hood estimation; see Section 4. The estimator of then follows through 

(2.13) . 

In Einmahl, de Haan and Piterbarg (2001), tail dependence is described 
via the measure A arising as the vague limit in [0, oo] 2 \ {(oo, oo)} of 

(2.14) sPT[(s{l-F 1 (X 1 )} ) s{l-F 2 (X 2 )})e-] AA(-), s ^ oo. 

Let P be the probability measure on [0, l] 2 induced by the random vector 
(U 1 ,U2):=(l-F l (X 1 ),l-F 2 (X 2 )). Then (2.14) can be written as 

(2.15) r^fi^Af.), no. 

Comparing (2.14) with (2.2), we find that fj, and A are connected through a 
simple change-of-variables formula: for Borel sets B C [0,oo] 2 \ {(oo,oo)}, 

(2.16) A(B) = n({(z 1 ,z 2 )eM:(l/z 1 ,l/z 2 )eB}). 
From (2.14) or also from (2.3) and (2.4), it follows that 

A(c-)=cA(-), 0<c<oo, 

(2.17) 

A([0,«] x [0,oo]) = A([0,oo] x [0, «]) = «, < u < oo. 

The equality above with u = shows that A does not put any mass on the 
coordinate axes. Combining (2.6) and (2.16), we find 

(2.18) $(•) = A({(ui,u 2 ) G (0,oo] 2 : IKu^ 1 ,^ 1 )!! > 1, arctan(u 2 /^i) € ■}). 
In particular, for u € [0, oo), 

A({oo}x (0,«]) = «$({0}), 

A((0,it] x {oo}) = u$({vr/2}). 
The spectral measure corresponding to the L p norm, 

^)b={Kt^- i ^ i ' 6[l • oo, • 

l \zi\ V \z 2 \, ifp = oo, 
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will be denoted by <J? p . Write 

oo 

(2.19) y p (x) 



1 + 



1 



xP-1 



i/p 



1, 



if xe [o,i), 

if x £ [1, oo] and p £ [1, oo), 
if x £ [1, oo] and p = oo. 



Note that for x > 1, y p (x) is the (smallest) value of y E [l,oo] that solves 
the equation || (a; -1 , j/ -1 ) || p = 1. Now by (2.18), 



$ P ([O,0])=A(C P , 6 



0e[O,vr/2], 



(2.20) 



where 

f ([0,oo] x{0})U({oo} x [0,1]), 
if = 0, 

{(x, y):0<x<oo,0<?/< (rrtan^) A y p (x)}, 

if < 6» < tt/2, 
{(z, y):0<x<oo,0<?/< y p (z)}, 
if = 7r/2. 

Further, note that xtan8 < y p (x) if and only if x < x p (6), where for 9 € 
[0,vr/2], 

(l + cotP^) 1 ^, ifpG[l,oo), 
lVcot0, ifp = oo. 

The relation between y p (x), x p (6) and C P: o is depicted in Figure 1. 



(2.211 



x p (6) = ||(1, cot d)\\ p : 



Remark 2.1. In this paper we shall make no assumptions on the margin- 
al distribution functions -Fi and F<i whatsoever except for continuity. How- 
ever, if in addition to (2.2) the marginal distribution functions F\ and F2 
are in the max-domains of attraction of extreme-value distribution func- 
tions G\ and G2, then F is actually in the max-domain of attraction of a 
bivariate extreme-value distribution function G with marginals G\ and G2 
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and spectral measure <3?. More precisely, if (2.2) holds and if there exist real 
sequences a n > 0, b n , c n > and d n such that F™(a n x + b n ) G\{x) and 
F^ic^y + d n ) -i- G2(y) for all i,j/GR and as n — > oo, then 

F n (a n x + 6 n , c n y + d n ) -* G(x, y) = exp[-Z{- log G x (x), - log G 2 (y)}} 

for all x,y e M. The function Z on the right-hand side is called the stable tail 
dependence function [Huang (1992) and Drees and Huang (1998)] and can 
be expressed in terms of either A, //or $ through 

Z(xi,x 2 ) = A({(ui,m 2 ) G [0,oo] 2 :ui < xi or u 2 < x 2 }) 
= fi({(zi,Z2) G [0,oo] 2 : 0i > xT 4 or z 2 > £ 2 1 }) 
max(ii sin#, X9 cos#) , 
[o,tt/2] ||(sin0,cos0)|| 

where we used (2.8) for the final step. Standardizing the marginals of G to 
the unit-Frechet distribution yields the extreme-value distribution function 

(2.22) G*(xi,x 2 ) = exp{-Z(l/xi, l/x 2 )}, xi,x 2 >0. 

In the notation of Coles and Tawn (1991), we have Z(l/xi, l/x 2 ) = V(x\,x 2 ). 
The Pickands dependence function A: [0, 1] — > [1/2, 1] is defined by A(v) = 
1(1 - v,v) for v £ [0, 1] [Pickands (1981)]; we have 

(X! , x 2 ) = exp{ - (- + - W^M ) . 

I \Xl X 2 J \Xl+X 2 J) 

The spectral measure H in Coles and Tawn (1991) and Joe, Smith and 
Weissman (1992) is connected to our $i through the relation 

(2.23) H([0,w]) = <S>i(&rctan{w/(l-w)}), we [0,1]. 

The Pickands dependence function A can be recovered from H by the for- 
mulas 



A(v) = I max-fWl — v), (1 — w)v\H(dw) 

J{0,X] 

(2.24) 

= l-v+ [ H([0,w])dw. 
Jo 



Example 2.2 (Asymmetric logistic model). The asymmetric logistic 
model with parameters r > 1 , < ip\ , ip 2 < 1 is defined by its stable tail 
dependence function 

Z(xi, x 2 ) = (1 - ^i)xi + (1 - ^ 2 )x 2 + {(^ixi) r + {ip 2 x 2 ) r } 1 l' r 

for (xi,xi) € [0,oo) 2 [Tawn (1988)]. The special case ^± = ip 2 = l yields the 
logistic model, originally due to Gumbel (1960). Tail independence arises for 
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r = 1 or "01^2 = 0, whereas complete tail dependence arises for tpi = ip2 = 1 
and r — ► oo. It can be shown that for 1 < r < oo, the spectral measure <3? p 
has point masses ^({0}) = 1 — ^2 and <& p ({tt /2}) = 1 — tpi, while on the 
interior (0, 7r/2) the measure $ p is absolutely continuous with density 

^(0) = (r - l)(^ 2 ) r (sin p # + cos p 0) 1/p (sin0 + cos0) r " 2 
do 

x (sin cos 9) r ~ 2 { (V'i cos #) r + (V> 2 sin 0) r } 1/r ~ 2 . 

3. Empirical spectral measures. Let (Xji,JQ 2 ), i = l,...,n, be inde- 
pendent bivariate random vectors from a common distribution function 
F satisfying (2.2). Our aim is to estimate the spectral measure & p cor- 
responding to the L p norm for arbitrary p€ [l,oo]. For convenience, write 
$ P (0) = $ P ([O,0]) for0€[O,ir/2]. 

Consider the left-continuous marginal empirical distribution functions 

1 n 

(3.1) Fj{xj) = - £ l{X i3 < x,), Xj €R,j = 1, 2. 
Define 

(3.2) U ij = l-F j (X ij ) = n+1 ~ Rii , i = l,...,n; j = 1,2; 

here -Rij = Ya=i ^-{p^-lj — Xij) is the rank of Xij among Xy, . . . , X n j. Let P n 
be the empirical measure of (Un, Ui2), i = 1, ■ ■ ■ ,n, that is, 

1 n , 

Pn(-) = -E 1 i(^i > C/i2)e-}. 
n r - ; 

4=1 

Observe that the transformed data (Un,Ui2), i = l,...,n, are no longer 
independent. This dependence will contribute to the limiting distribution of 
the estimators to be considered. 

Let k = k n £ (0, n] be an intermediate sequence, that is, k — > oo and k/n — ► 
as n — > oo. We find our estimator $ p by using (2.15) and (2.18) with t = k/n 
and P replaced by P n . In terms of distribution functions, this becomes 



1 n 

l{(n + 1 - ^i)" p + (n + 1 - i^ 2 )" p > A:" p , 

i=l 



n + 1 - P i2 < (n + 1 - Rn) tan6»} 
for 9 G [0,vr/2] and with C Pi e as in (2.20). 
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In Einmahl, de Haan and Piterbarg (2001), the limiting behavior of & p 
has been derived in case p = oo. We now present a generalization to all L p 
norms for p G [l,oo]. More precisely, we will study the asymptotic behavior 
of the process 

sfk{%{9) - * p (0)}, 0g[O,tt/2]. 
We will assume that 
(3.3) A = A c + A d , 

where A c is absolutely continuous with a density A, which is continuous on 
[0,oo) 2 \{(0,0)}, and with A d such that A d ([0, oo) 2 ) = 0, A d ({oo} x [0,u]) = 
u&p({0}) and A d ([0,u] x {oo}) = u<f> p ({ir/2}) for u G [0,oo). In contrast to 
Einmahl, de Haan and Piterbarg (2001), 3> p is allowed to have atoms at 
and 7r/2; in particular, tail independence is allowed. Also, the restriction 
of & p to (0,7r/2) is absolutely continuous with a continuous density. This 
excludes complete tail dependence, that is, <& p being degenerate at 7r/4, in 
which case A is concentrated on the diagonal. The homogeneity of A in 
(2.17) implies that X(cu\,cu2) =c A(iii,U2) for all c> and (111,142) G 
[0,oo) 2 \{(0,0)}. 

Let P n be the empirical measure of (Un, tfo) = (1 — Fi(Xn), 1 — ^2(^2)), 
i = l,...,n, and let T jn (u) =n~ 1 J2?=i <«),«£ [0,1] and j G {1,2}, be 
the corresponding marginal empirical distribution functions; for u G (l,oo], 
we set Tj n (u) = u. Furthermore, for 9 G [0,7r/2], define the set 

C p ,e = j{(ui,u 2 ) G [0,oo] 2 \ {(00,00)} : (T ln (u 1 ),T 2n (u 2 )) G -C p>t 

From the identity T jn (u) = 1 - ^(^(l - u)) for u G (0, 1) it follows that 

This representation yields the following crucial decomposition: for 6 G [0, tt/2] , 



(3.4) _ lk Vn 7 J 

+ v / fc{A((7 p , e ) - A(C p , e )} 

= :V n , p (9) + r n>p (0)+Y n)P {e). 

The first term, V^ )P , features a local empirical process evaluated in a random 
set C Pj q. The second term, r njP , is a bias term, which will vanish in the 
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limit under our assumptions. The third term, Y UtP , is due to the fact that 
the marginal distributions are unknown and captures the effect of the rank 
transformation in (3.1) and (3.2). 

Next we will define the processes that will arise as the weak limits of the 
processes V n , p and Y n<p in (3.4). Define W\ to be a Wiener process indexed 
by the Borel sets of [0, oo] 2 \ {(oo, oo)} and with "time" A, that is, a centered 
Gaussian process with covariance function E\W\(C)W\{C')\ = A(C n C). 
We can write, in the obvious notation, W\ = W\ c + W\ d , where the two 
processes on the right are independent. Note that 

(^A(G») ee[(W2] = m*pV)))eeio,«/2) 

with W a standard Wiener process on [0, oo). Define W\(x) = Wa([0,x] x 
[0,oo]) and W 2 {y) = W A ([0,oo] x [0,y]) for x,y G [0,oo). Note that Wi and 
W 2 are standard Wiener processes as well. For p G [1, 00), define the process 
Z CyV on [0,7r/2] by 

r x p (8) 

Z cp (9) = 1(0 <tt/2) / A(x,xtan0){Wi(x)tan0- W 2 (xtan9)} dx 
Jo 

\{x,y p {x)){Wi{x)y' p {x) - W 2 (y P (x))} dx, 



+ < 



x v {9) 

if p < 00, 

/•lVtanfi poo 

-Wiil) I X(l,y)dy-W 2 (l) I X(x,l)dx, 

lvcot e 



if p = 00 

with y' the derivative of y p . Define by 

Z d (9) = -%{{Q})W 2 {\), 9e[0,n/2], 
and write Z p = Z CtP + Z&. It is our aim to show that 

{V n , P ,r n , p ,Y ntP ) -> (Wa(C p> .),Q,Z p ), n^oo. 

This convergence and the decomposition in (3.4) then will yield the asymp- 
totic behavior of y/k($ p — $). 

Assume that P is absolutely continuous with density p. Then the measure 
t _1 P(£-), for t > 0, is absolutely continuous as well with density tp(tu\,tu 2 ) . 
For 1 < T < 00 and t > 0, define 

(3.5) T>T(t) := / / \tp(tui,tu 2 ) - \(u\,u 2 )\ du\ du 2 , 

J Jc T 

where Ct = {(u±,u 2 ) :0 < u\ A u 2 < 1, u\ V U2 < T}. 
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Theorem 3.1. Assume the framework of Section 2 and suppose A is as 
in (3.3). Then, ifDi/t(t) — ► as t j and if the intermediate sequence k is 
such that 

(3.6) \ZkV n / k (k/n) — > 0, n — ► oo, 
then in D[0,n/2] andasn^oo, 

(3.7) >/fc(*p-* p )-iWA(C7p 1 .) + ^p=:a p . 

The condition lim t j 'C , i/t(i) = in Theorem 3.1 implies <3? p ({0, 7r/2}) = 
and thus = 0. Indeed, in case 7^ 0, the convergence in (3.7) cannot 
hold: when, for example, $ p ({0}) > 0, we have, since $p(0) = 0, yk{$ p (0) — 
3>p(0)} — > —00. In contrast, the following result does allow to have atoms 
at or 7r/2. Recall T>x{t) in (3.5) and a p in (3.7). 

Theorem 3.2. Let n £ (0, vr/4). Assume the framework of Section 2 and 
suppose A is as in (3.3). Then, ifV\(t) — > as t [ and i/ the intermediate 
sequence k is such that 

(3.8) v^mf {X> T (A;/n) + l/r}^0, rw 00, 
then in D[rj, ir/2 — rj] and as n — > 00, 

(3.9) Vk($ p - $ p ) 4 ap. 



In case of tail independence, that is, <J> P ({0}) = & p ({ir /2}) = 1 and A = 0, 



we have a p = 0. 



Under a stronger condition on the sequence k, the convergence of the 
process y/k~($> p — & p ) holds on the whole interval [0,7r/2], provided that we 
flatten the process on intervals [0, r) n ] and [n/2 — r) n ,7c/2], with rj n £ (0, 7r/4) 
tending to zero sufficiently slowly. Define the transformation r n : [0, 7r/2] — > 
[0,tt/2] by 



(3.10) r„(0) 




if < 6* < r? n , 
if Vn<0< ir/2 - r) n , 
rj n , if 7? n , < 6> < ir/2, 
if 6 = ir/2. 



Theorem 3.3. Let k be an intermediate sequence and let rj n = (k/n) a 
for some fixed a £ (0, 1). Assume the framework of Section 2 and suppose A 
is as in (3.3). If 

(3.11) Vk inf {V T {k/n) + 1/T} 0, n->oo, 
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then in D[0,n/2] andasn^oo, 

(3.12) Vk($ p — 3> p ) OT n ^ Op. 

Theorems 3.1 and 3.3 will be instrumental when establishing our main 
results in the next section. 

4. Enforcing the moment constraints. Fix p G [l,oo] and let Q p be the 

class of probability measures Q p on [0, 7r/2] such that 



(4.1) / f(8)Q p (d6)=0, 

J[0,n/2] 

where 

< 4 - 2 > 

If Q p is the spectral probability measure of some exponent measure n with 
respect to the L p norm, then Q p G Q p by (2.12). Conversely, if Q p £ Q p , 
then we can define an exponent measure \x through (2.8) and (2.13) which 
has Q p as its spectral probability measure with respect to the L p norm. 
As before, denote distribution functions of measures under consideration by 
Q P (0) = Q p ([O,0]),etc. 

In view of (2.10), we define the empirical spectral probability measure Q p 

by 

where N n = \I n \ and 

@ in = arctan(LTj 2 /L^i), i = l,...,n; 
I n = {i = l,...,n:\\(U~ 1 \U^)\\ p > n /k}. 



Typically, 



is different from zero, in which case Q p does not belong to Q p , that is, Q p 
is itself not a spectral probability measure. 

Therefore, we propose to modify Q p such that the moment constraint 
(4.1) is fulfilled and the new estimator does belong to Q p : define 

Qp(') := E Pinl{<din G ■), 

iein 
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where the weight vector (pi n : i G I n ) solves the following optimization prob- 
lem: 

maximize Tpinj 

i 

constraints pi n > for all i G I n , 
(4.4) " 

Z2Ptn = 1, 

i 

i 

The thus obtained estimator Q p can be viewed as a maximum empirical 
likelihood estimator (MELE) based on the sample {Oi n :i G I n }; see the 
monograph Owen (2001). Actually, the optimization problem in (4.4) can 
be readily solved by the method of Lagrange multipliers [see, e.g., Owen 
(2001), page 22]: let p, n be the solution in (—1, 1) to the nonlinear equation 

<"> ^rrBkrr 

and define 

(4.6) p in = -L - 1 iei 

N n l + Mn/(8m) 

then the vector (pi n : i G I n ) is the solution to (4.4). Observe that the original 
estimator Q p corresponds to jl n = and is the solution to (4.4) without the 
final constraint J2iPinf(<8>in) = 0. 

Since Q p G Q p , we can exploit the transformation formulas in Section 2 
to define estimators of the spectral measure $ p : as in (2.13), 

*p(0 :=Qp(0 /m p (Q p ), 
where for a bounded, measurable function h: [0, 7r/2] — >R, 

mp(h):=- r' 2 h{e)d-. 



lo ||(sin0,cos0)||p' 
cf. (2.12). Further, for 6 G [0,tt/2], define 1(9) = f m f{®)dQ P {'&) and 

$ p (ir/2)a p (6) - a p (vr /2)%{9) 



M0) 



$2 (vr/2 ) 



(4.7) ^)=ft(«) + ^M-^ 

J[0,tt/2] •> a( ^P 
r _ ^p(Q P )7p(^) -ra P (7p)Qp(0) 
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Note that under the assumptions of Theorem 4.1 below, Q p ({tt / '4}) < 1 and 
thus / f 2 dQ p > 0, so that 7 P (0) is well defined. 

The next two theorems, providing asymptotic normality of are the 
main results of this paper. 

Theorem 4.1. Let the assumptions of Theorem 3.1 be fulfilled. Then 
with probability tending to one, (4-5) admits a unique solution jl n and hence 
in this case the vector (pi n :i E I n ) in (4-6) is the unique solution to (4-4)- 
Also, in D[0,tt/2] and as n — > oo, 

(4.8) Vk(Q p - Q p ) 4 7p , 

(4.9) Vk{% - %) 4 S p . 

Since Theorem 4.1 is based on Theorem 3.1, the spectral measure cannot 
have atoms at or ir/2. The following result, based on Theorem 3.3, does 
allow for such atoms. 

Theorem 4.2. Fix r\ G (0,tt/4) and let r\ n = (k/n) a for some < a < 1. 
Assume the framework of Section 2 and suppose A is as in (3.3). If 

(4.10) VkV 2/rin (k/n) + Vkr) n ^0, oo, 

then in D[rj, ir/2 — rj] and as n — > oo, the convergence in (4-8) and (4-9) 
holds. 

Remark 4.3. From (4.7), it is straightforward to express the limit pro- 
cess 8 P in terms of the process a p and thus of W\. However, because of the 
presence of the process Z p , no major simplification occurs. As a consequence, 
we were not able to show that <^ p is asymptotically more efficient than <E> P . 
However, the simulation study in Section 5 does indicate that enforcing the 
moment constraints leads to a sizeable improvement of the estimator's per- 
formance. 

Remark 4.4. Replacing <E>i by $i in (2.23)-(2.24) yields an estimator 
A of the Pickands dependence function A that is itself a genuine Pickands 
dependence function. The weak limit of the process \fk(A — A) in the func- 
tion space C[0, 1] can be easily derived from the one of y / A;( < i > i — <E>i). Non- 
parametric estimation of a Pickands dependence function in the domain- 
of-attraction context was also studied in Caperaa and Fougeres (2000) and 
Abdous and Ghoudi (2005). 
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5. Examples, simulations and real data analyses. 

5.1. Examples. 

Example 5.1 (Cauchy). Consider the bivariate Cauchy distribution on 
(0, oo) 2 with density (2/7r)(l + x 2 + y 2 )~ 3 ^ 2 for x,y>0. It follows that 

A([0, x] x [0, y]) = x + y - (x 2 + y 2 ) 1/2 , < x, y < oo 

and 



$ p (0)= / ||(sintf,cosi?)|Lc2tf 

JO 

for 9 £ [0,7r/2]. It can be shown that = 0(t) as t J. 0. Therefore, 

Theorems 3.1 and 4.1 hold when k = o(n 2 / 3 ) as n — > oo. In Figure 2, the 
function $ p is plotted for p S {l,2,3,oo} together with trajectories of the 
empirical spectral measure and the MELE for a single sample of size n = 
1000 and at k = 30. Observe that both estimators share the same set of 
atoms but with possibly different weights. 

The bivariate Cauchy distribution on the full plane M 2 has density func- 
tion (27r) _1 (l + x 2 + y 2 ) -3 / 2 for x,y £ R and spectral measure 

$ p (0) = ^(l + J° ||(sintf, costf)|| p d-d + 1(0 = tt/2) 

In particular, ^({0}) = ® p ({ir/2}) = 1/2. For every < a < 1 and % = 
(k/n) a , we find T>2/ rin (k/n) = 0((k/n) 2 ~ a ) as n — ► oo. Hence the conclusions 
of Theorems 3.3 and 4.2 hold provided k = o(n 2a ^ 2a+1 ^) as n — > oo. In fact, 
the results of Theorem 4.2 can be shown to hold when k = o(n 2//3 ) as n — > oo. 

Example 5.2 (Mixture). For r G [0,1], consider the bivariate distribu- 
tion function 

F(x,y)=(l--)(l--)(l + ^—), x,y>l; 



yj\ x + y, 

cf. de Haan and Resnick (1977), Example 3. Its density can be written as a 
mixture of two densities, (1 — r)fi(x,y) + r/2(x,y), where 

,( x 1 ./ v 2 / x 2 + 3xy + y 2 ' 

= iv ' /2(x ' y) = Jx+w v + — ^v~~ 

Note that f\ is the density of two independent Pareto(l) random variables. 
Obviously for r = we have (tail) independence. The law P of (1 — F\ (X) , 1 — 
F 2 (Y)) = (1/X, l/Y) is determined by 

P([0,tt] x [0,v]) = w(l + r- - , 0<u,u<l, 
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Bivariate Cauchy on 2 ; p 1 Bivariate Cauchy on 2 ; p 2 



true 

empirical 
MELE 



true 
**• empirical 
— MELE 



Bivariate Cauchy on ; p 3 



Bivariate Cauchy on ; p Inf 





Fig. 2. Trajectories of the empirical spectral measure (dashed) and the MELE (solid) for 
a single sample of size n — 1000 and at k = 30 from the bivariate Cauchy distribution on 
(0, oo) 2 and for p £ {1, 2, 3, oo}. 



hence 



A([0,x] x [0,y])=r 



x + y ' 



< x, y < oo. 



For p£ [1, oo], the corresponding spectral measure $ p satisfies $ p ({0}) = 
$ p ({vr/2}) = l-r and 

* p («0 = l-r + 2r np^% d ^ + (l-r)l(9 = ,/2) 
py ' Jo (cosi9 + smi9) 3 v J y 1 ' 

for G [0,7r/2]. It can be seen that T>T(t) = TO it) as t [ 0, uniformly in 
T > 0. As a consequence, conditions (3.11) and (4.10) in Theorems 3.3 and 
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4.2 hold for a = 1/2 provided k = o(n 1 / 2 ) as n — > oo. If r = 1, the spectral 
measure & p has no atoms. Then T>i/ t (t) = 0{t) as t [ 0, so that condition 
(3.6) in Theorems 3.1 and 4.1 holds provided /c = o(n 2//3 ) as n — > oo. 

5.2. Simulations. In order to compare the two spectral measure estima- 
tors, a simulation study was conducted involving the following distributions: 

• Figure 3: the extreme-value distribution G* in (2.22) with logistic depen- 
dence structure r = 2 and tpi = ip2 = 1 (Example 2.2); 

• Figure 4: the bivariate Cauchy distribution on (0,oo) 2 (Example 5.1); 

• Figure 5: the mixture distribution with r = 0.5 (Example 5.2). 

For each distribution, 1000 samples of size n = 1000 were drawn. For each 
sample, the empirical spectral measure and the MELE were computed for 
various ranges of k and at p £ {l,2,3,oo}. For each estimate, the Inte- 
grated Squared Errors ($ p — & P ) 2 and J^{^ p — & P ) 2 were computed, where 
(a, b) = (0, vr/2) for the logistic model and the Cauchy distribution on (0, oo) 2 
whereas (a, b) = (0.057r/2, 0.957r/2) for the mixture model. Next, the ISEs 
were averaged out over the 1000 samples, yielding the estimates of the Mean 
Integrated Squared Errors which are displayed in the figures. 

From the plots, it can be seen that in all these cases the minimum MISE of 
the MELE is smaller than the one of the empirical spectral measure. Overall 
the MELE outperforms the empirical spectral measure, but in two cases, 
away from the minimum MISE, the empirical spectral measure performs 
slightly better than the MELE (Figure 5). Moreover, most of the time, the 
MISE is decreasing in p. Finally, the presence of atoms at the endpoints has 
an adverse effect on the estimators. 

5.3. Case studies. 

5.3.1. Loss-ALAE data. In Frees and Valdez (1998), copula fitting was 
illustrated on a dataset of 1500 insurance company indemnity claims, dis- 
played in Figure 6. Each claim consists of an indemnity payment (Loss) and 
an allocated loss adjustment expense (ALAE), that is, an expense specifi- 
cally attributable to the settlement of the individual claim such as lawyer's 
fees and investigation expenses. 

The problem is which bivariate tail dependence structure to fit to these 
data. The first question is whether the data are tail independent or not. If 
yes, then the spectral measure should be the sum of the Dirac measures at 
and 7r/2. In Figure 6, the MELE (p = 1, k = 40) shows substantial increase 
away from the endpoints, in clear contrast to the case of independence, for 
which the spectral measure is flat on (0, vr/2). Note that a formal test for tail 
independence is not possible, since under the null hypothesis of tail inde- 
pendence the limit process a p in Theorem 3.1 (and hence all limit processes 
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Fig. 3. MISE of the empirical spectral measure (dashed) and the MELE (solid) for 1000 
samples of size n = 1000 from the logistic model with r = 2 and p £ {1, 2, 3, oo}. 



in subsequent theorems) is degenerate at zero; in contrast, tests with tail 
independence in the alternative hypothesis can be constructed in the more 
specialized framework of Ledford and Tawn (1996). By way of illustration, 
we compared the MELE for the Loss-ALAE data with the MELE for a 
random sample of the same size from the bivariate lognormal distribution 
fitted to the data (right-hand panel of Figure 6) . The MELE for this sample 
is compared to the MELE of the Loss-ALAE data in Figure 7 (left-hand 
panel). Despite the fact that the correlation (on log scale) of both samples 
is equal to 0.44, the MELE of the lognormal sample is much closer to the 
spectral measure of independence (which is the true one for this sample). 

The next question is then which model to fit to the data. In 
Genest, Ghoudi and Rivest (1998), the logistic model (Example 2.2) is pro- 
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Fig. 4. MISE of the empirical spectral measure (dashed) and the MELE (solid) for 1000 
samples of size n — 1000 from the bivariate Cauchy distribution on the positive quadrant 
and p 6 {1, 2, 3, oo}. 



posed, while in Beirlant et al. (2004), Chapter 9, both the logistic (r = 0.73) 
and the asymmetric logistic model (r = 0.66, ip\ = 1, tp2 = 0.89) are fitted. In 
the left-hand panel of Figure 7, the spectral measures (p = 1) of both para- 
metric models are displayed and compared to the MELE. The asymmetric 
logistic model follows the MELE much more closely. Interestingly, the spec- 
tral measure of the fitted asymmetric logistic model does not have an atom 
at 7r/2 while it has an atom at of size 1 — ipi = 0.11. Formal goodness- 
of-fit tests based on the stable tail dependence function I are described in 
de Haan, Neves and Peng (2008) and Einmahl, Krajina and Segers (2008). 



20 J. H. J. EINMAHL AND J. SEGERS 
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Fig. 5. MISE of the empirical spectral measure (dashed) and the MELE (solid) for 1000 
samples of size n = 1000 from the mixture distribution with r — 0.5 and p £ {1, 2, 3, oo}. 



5.3.2. NHANES data. We consider the variables Standing Height (cm) 
and Weight (kg) for females from the National Health and Nutrition Exami- 
nation Survey (NHANES) 2005-2006 [National Center for Health Statistics 
(2007)]. Data of females of age 18-64 years only have been retained; this 
leads to a sample size of 2237. In Figure 8, the MELE of the spectral mea- 
sure (p = 1) has been computed for k G {25, 50} and compared to the spec- 
tral measure for tail independence. Although the data have a nonnegligible 
positive correlation (0.283), the MELE strongly suggests tail independence, 
since it hardly shows any increase away from the endpoints and n/2. The 
conclusion is that extremely tall women are not extremely heavy and con- 
versely. This nicely illustrates the difference between (in)dependence and 
tail (in)dependence. Similar results were obtained for the data for males. 
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bivariate log normal data frho = 0.44) 




° o g 




Fig. 6. Left: Scatterplot of the Loss-ALAE data (axes on log-scale). Right: Scatterplot of 
a bivariate lognormal sample of the same size and which on a log-scale has the same mean 
vector and covariance matrix as the Loss-ALAE data. 



6. Proofs of Theorems 3.1 3.3. 

Proof of Theorem 3.1. A. We first prove weak convergence of the 
process \fk(& p — $ p ) in D[0,tt/A]. More precisely, with A £ {l,|,i,...}, 
we will show that for probabilistically equivalent versions of the processes 
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Fig. 7. Left: MELE (p = l, k — 40) of the spectral measure of the Loss-ALAE data and 
MELE of the bivariate lognormal data of Figure 6. Right: MELE of the spectral measure 
(p — 1, k = 40) of the Loss-ALAE data compared to the spectral measures of the fitted 
logistic and the fitted asymmetric logistic models. 
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Fig. 8. Scatterplot (left) and spectral measure estimates for p = 1 (right) of the variables 
"standing height" and "weight" in the NHANES 2005-2006 body measurement data for 
females. 



involved and any e > 0, 

limlimsupPrj sup \Vk{$ p (6) - <£ p (6>)} 

AJ,0 n -*co L 6»e[0,7r/4] 

(6-1) 

- {W A (C Pt9 ) + Z p (6)}\ >3e} = 0, 

where $ p = & p ,a and Wa = Wa,a- In part B below, we will prove weak 
convergence in D[0,tt/2\. 

Fix A 6 {1,2,3,...} an< ^ M > 1; later on, M will be taken large. Let 
A' = {AD A' :A,A' eA}, where A = A(A, M) is a Vapnik-Cervonenkis (VC) 
class of sets defined as follows. For m = 0, 1, 2, . . . , ^ — 1 define 

f [mAx p {6),{m + l)Ax p (0)}, if 9 G (0,7r/4], 
I A (m,e) = < [0,oo), if (9 = and m = 0, 

[ 0, if (9 = and m > 0; 

JaM = + (2^ - l)(m + l)A),y p (l + (2^ - l)mA)]. 

Set .4 to be the class containing all the following sets: 
l/A-l 

|J {(rr, y):x£ I&(m),Q< y < xtanO + B m (xt&n6) 1/m }, 

for some #G [0,tt/4] and B , Si, . . . , -Bi/A-l G [-1,1], 

l/A-l 

y J&{m), x > x p (9), < y < J/ P (a:)(l + if m )}, 

m=0 
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and 



for some 6 G [0, 7r/4] and Ko,K±, . . . , if i/a-i G 
{(x,y) :x < a}, {(x,y):y<a} 

{(x, y) : x < a or y < a}, for some a £ [0, M]. 



1 1 

2' 2 



Next define A = : ,4 G .A}, where, for A G A A s = {(x,y) : (y,x) G A}. 
Finally define A = A U A s . 

From lim m D 1/t (t) =0, r 1 P([0,oo] x [0,t]) = A([0,oo] x [0,1]) = 1 (0 < 
t < 1), and the homogeneity of A we obtain 



(6.2) 



lim sup \t~ l P(tA) - K{A)\ = 
AeA' 



for all A G and M > 1. Theorem 3.1 in Einmahl (1997) now 

yields our basic convergence result: for a special construction (but keeping 
the same notation) we have 



(6.3) sup 

AeA 



0. 



n — ► oo. 



Throughout, we will work within this special construction. 
In the sequel we can and will redefine C Pi $, 6 € [0,7r/4], by 

(x, y) : < x < oo, y > 0, y < ^Q 2n ( (tan 6)T ln f-x 



k 



n 



n 



k 



k ( n 



V<-rQM -Vp[ z r i 



n \k 



k 



n 



where Qj n is the quantile function corresponding to Tj n , j = 1,2, with 
Qjn(y) '■= for < y < (2n) _1 by convention. Define the marginal tail em- 
pirical processes by 



w 



■ m (x) — ^jy;^"^" 1 



X — X 



and the marginal tail quantile processes by 



in (x) = Vk\ TQjn 



k 



n 



x — x 



x>0, j = 1,2, 



x>0, j = 1,2. 



Note that Wj n and vj n converge almost surely to Wj and — Wj, respectively, 
for j = 1, 2, uniformly on [0, M\. Observe that for x > 0, 



(6.4) 



n 



k 



k 



-r<22n (tan^)r ln —x =xtan9 + 



n 



z n ,e(x) 
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where 

(6.5) z n fi{x) = w\ n (x) tan 6 + V2 n xt&nO -\ -= — tan# 



Also 



n ( k ( n ( k 
-Q2n[ -y P [ 7 r i„ -x 



k \n \k \n 

( w ln {x) \ 1 ( ( , wi n {x) 

= y P [x^ jy- + -7fV 2n [y P [x + 



(6.6) 

( u 

~Vk j 1 vk^ n v p v 1 Vk 

We will treat the terms V njP {ff), Y n>p (0) and r niP {6) from (3.4) in paragraphs 
A.1-A.3, respectively. 

A.l. First we deal with V n , p {6) in (3.4). Set 

Cpfi ) i = {(x,y)eCp ) e:x<x p (0)} and C p , dt2 = C p ,e\C p ,e,i- 

We focus on both sets separately when considering V np {9). For p = oo, C p fi,i 
has been dealt with in Einmahl, de Haan and Piterbarg (2001). We will omit 
the small modifications that are needed for general p£ [l,°o]. However for 
C Pi 0,2; the case p = oo is trivial compared to p £ [1, oo). Therefore we will 
consider 

W):=V*{fF„g<? p ,„, 2 )-=p(^ pA2 )} 

in detail now. 

Recall z n fi(x) in (6.5) and define s n (x) through 

y p [x + j^- +— V2n [y p [x + ^ = y p (x)[l + 



\fk J \fk V V \fk J J V \fk 

Further, put 



x£j&(m) 
x>x p (8) 



y P ( 



x>Xp(6) 

Set, for either choice of sign, 



(x,y) :x G J A (m),x> x p (9),0< y < y p (x) ^1 + 



and 

l/A-l 
^,0 = U R m,A,i 



m=0 
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We have 



(6.7) +^p(^(Nie\N^ e ) 



similarly 



(6.8) -Vk^P^(Nl \N A:9 ) 

=--v- p>2 {e)-r n , p , 2 {e). 

We first deal with r n ^ p ^(0) and next with V^ p2 {0). Using (3.6) and well- 
known results on tail empirical and tail quantile processes [Csorgd and Horvath 
(1993) and Einmahl (1997)] we can show that, as n — ► oo, 

(6.9) sup \r ntPt2 (9)-VkA(N+ e \N Ae )\^0. 
0e [0,71-/4] 



Now consider 



sup VkA(N+ e \N Ai 

9£{0,n/4[ 



Set Cm = y p (l + (2 x / p - l)mA) V x p (0) and note that 

VkA(N+ 9 \N Afj )<Vk £ / / r- K*,y)dydx. 

m=0 Jc m+1 Jy P (x)(l+W m A e /Vk) 

Setting y = y p (x)(l + z/^/k), we can rewrite the right-hand side of the pre- 
vious display as 



Y] f Vp( x ) f _ m Ae ^(x,y p (x)(l + -A=\) dzdx 
m=0 Jc m+1 JW \ VkJJ 
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^j^ 1 f w+ tA>g f (<? m -i)Vp/(i+w- tA>g /Vk) f {(1 + z/VfyvF V"Vp 



m=0 J W- A , e J(<? m+1 -lV/r/(l+Wl Ag /Vk)\l + {(l+z/Vk)v}P 

x X(v, 1) dv dz. 

The integrand is bounded by \{v, 1), whence 

^A(iV+ , \ iV A ) < max (W+ A ,, - W~ Afi ) 

mG{0,I,...,l/A— 1} 

(6.10) 

H^-i)VP/ ( i +w - A yVfc) 
x > / _ A(v, l)cfo. 

r^O • / (^ + i- 1 ) 1/p /( 1 +^,A, e /v / fc) 

We have 
(6.11) 

1 ( ( . Wi n (x) 

H rr" 2 " * x + 



Now from the behavior of tail empirical and tail quantile processes it readily 
follows that sup xg [ 2 i/p |s re (x)| = O p (l). Hence the right-hand side of (6.10) 
can be bounded, with probability tending to one, by 

roo 

3 n W A r^) X(v,l)dv. 

me{0,l,...,l/A-l} ' ' JO 

As A has uniform marginals, necessarily f£° \(v, 1) dv < 1. So in summary 
we have for fixed A and with probability tending to one, 

sup VkA(N+ e \N^ e ) 

6>e [0,7r/4] 

(6.12) 

<3 sup max %x \W+ Afi -W mA$ ). 

From the behavior of s n it follows that for any 5 > 0, 

limlimsupPri sup max n (^ A ,e ~ W~ A e ) > S \ = 0, 

AJ.0 n-»oo Lfl6[0 ) 7r/4]wie{0,l,...,l/A-l} ' ' ' ' J 

and hence, by (6.9), 

(6.13) limlimsupPrj sup r n p 2(6*) > — j = 0. 

A^0 n— 00 L 6»e[0,7r/4] ' ' 2J 

Next consider V^„ 2 (^)j f° r either choice of sign. Since 
lim Pt{n£ £ -4, for all 6 £ [0, vr/4]} = 1, 
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we have, using (6.3), 

(6.14) sup \V± p>2 (e)-W A (Nl e )\^0, n^oo. 

0e[O,7r/4] 

But with similar calculations as for (6.12) we obtain 

with C P) e 5 2 — {(x, y) G C v fi : x > x p (8)}. Since 

sup max jW^ Ae \ = P (1), n ^ oo, 

6»e[0,7r/4] me{0,l,...,l/A-l} 

we have for any A 6 {1, ^, |, . . .}, 

sup A(N^ e A C p>e>2 ) A 0, re — ► oo. 

ee[0,7r/4] 

Hence, since Wa is uniformly continuous on .A with respect to the pseudo- 
metric A(A A A') for A, A' e A, 

(6.15) sup \W A (N± e )-W A (C pA2 )\^0, n-oo. 

0e[O,7r/4] 

Combining (6.7), (6.8), (6.13), (6.14) and (6.15), we now have proven 
limlimsupPrj sup \V n ^ 2 {0) - W A (C p62 )\ > e} = 0. 

A|0 rwoo l 06[O,7r/4] ' 

This, in conjunction with the aforementioned result for C Pt e,i, yields 

(6.16) limlimsupPrj sup \V np (0) - W A {C p g )\ > 2e\ = 0. 

A^O n^oo L ee[0,7r/4] ' J 

A. 2. Next we consider Y n>p {0) = \fk{A(C Pt e) - A(C Pt e)}. We will show that 

(6.17) sup \Y n , p {6) -Z p {6)\ AO, rw oo. 

9e[0,7r/4] 

Again, we will only consider C p fi^- The other part, C p ,e,i, can be handled 
as in Einmahl, de Haan and Piterbarg (2001); only minor modifications are 
needed. 

So we will need to show that, as n — > oo, 
sup Vk{A(C pfi ,2) ~ A{C pfi ,2)} 

0S[O,7r/4] 

roc 

- / X(x,y p (x)){W 1 (x)y' p (x)-W 2 (y p (x))}dx ^ 

Jxp(6) 



0. 



28 J. H. J. EINMAHL AND J. SEGERS 

Observe, with z n ^ and s n as in (6.5) and (6.11), respectively, that 
Vk{A(C p>6;2 ) - A(C p>e>2 )} 

r-oo ry p (x){l+s n (x)/Vk} 



(6.19) 



where 



Vk 



s n (x) = s n (x) A 



xp(8) ->y P ( x ) 

z n ,e{x) xta,n6 

y P (x) V y P {x) 



X(x,y)dydx, 



1 . 



Since for fixed x > x p {6) the expression Vk( x y t& ^ ~ 1) tends to infinity, it 
follows that we can (and will) replace s n (x) by s n (x) in the integral on the 
right-hand side of (6.19). Write 



1 



Now 



sup 

6>S [0,7r/4] 



Vk 



s(x) = ^{W 1 (x)y' p (x) - W 2 (y p (x))}. 

y p (x)(l+s„(x) I 'Vk) 



x P (0) Jy v {x) 

oo 



< sup 

6»G[0,tt/4] 



Vk 
+ Vk 



x p {9) 

00 rV P {x){l+s n (x)/Vk) 
x P {6) Jy p (x)(l+s(x)/Vk) 
00 rvp{ x ){^+ s i x ) /Vk) 
x p {$) Jy P (x) 



x p (6) 



< sup 

6»e[0,7r/4] 



+ sup 

ee[o,7r/4] 



--:T 1 +T 2 . 



\(x, y) dy dx 
y p (x)X(x,y p (x))s(x)dx 

X(x,y) dydx 

X(x, y) dy dx 
y p (x)X(x,y p (x))s(x)dx 

oo rs n (x) / / z \ \ 

y ) Vp ^ x Vp ^ \ + Vk ) ) dzdx 

A(W)(l + -£) 



oo fs(x) 



x p (6) JO 



y P (x) 



X(x,y p (x)) 



dzdx 



Since A(u,l)=u 1 A(l,l/w) and by continuity of A on [0, oo) 2 \ {(0, 0)}, 
we have lim^oo X(v, 1) =0 and thus sup„ >0 A(u, 1) < oo. For some (large) 
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M>2 



T\ < sup 

6»e[0,7r/4] 



x p (8) 
oo ps n (x) 



MVXp($) rs n (x) l /( X P_1)1/P 

(x) 1 + z/Vk \ 1 + z/Vk ' 



1 ) dz dx 



+ 

We first show 
(6.20) 

Define 



poo ps n (x) / z \ 

/ / X[(x p - 1) / p , 1 + — ] dzdx 

JM Js(x) V VkJ 



■-■Txj. + T x ,2. 



sup \s n (x) — s(x)| — ► 0, 

2 1 /p<a;<M 



n — s- oo. 



2/p0) 



u>i„(x) + 



1 



Wl n (x) 



2/ p (x) y p (x) 

Then it follows from the mean-value theorem and the almost sure conver- 
gence of w\ n to W±, uniformly on [0, M], that 

sup |s n (x) — s n (x)| — > 0, re — > oo. 

2 1 /p<x<M 

It also follows easily that 

sup |s n (x) — s(x)\ — > 0, re— > oo, 

2 1 /p<x<M 

whence (6.20). We have with probability tending to one, 

Ti s i < 2Msup X(v, 1) sup \s n (x) — s(x)\, 

v>0 2 1 /p<x<M 

which, because of (6.20), tends to in probability (for any M > 2). Let k > 
and set 5 = i/k/2. Using again (6.20) and the behavior of W\ near infinity, 
we see that for large enough M and with probability tending to one, 

-Wa(l)+a / z \ 

A(Jx p -l) 1 / p ,l + -^= J dzdx 



Ti, 2 < 



M J-W 2 (l)-<5 
W 2 ( 



p-W 2 (l)+S poo i /(' T P_1 > \1/P \ 

/ / ^A(^ t^Adxdz 

J-W 2 (i)-& Jm 1 + z/Vk \ 1 + z/Vk J 



< 



< 



W 2 (l)- 
W 2 {l)+S poo 



z/Vk \ 1 + z/Vk 

{(i + z/Vk)v}p y-v* 

VK 2 (l)-5 A/2(A/p-1)Vp V 1 + {(1 + z/v 7 ^)^}^ 
W 2 (l)+S 



X(v, 1) cfo cfe 



W 2 (l)-<5 



6dz = 25' = K/2, 



whence 
(6.21) 



lim limsupPr{Ti > k} = 0. 

M— >oo n— *oo 
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Now consider T 2 . Write ||s|| = sup 2 i/ P<:r<00 \s(x)\ and 



D n = sup 

x>2!/p 

-|MI<*<NI 

For M>2 1 / p , 

foe fs(x) 

T 2 = sup 

6»e[0,7r/4] 



- 1)Vp, 1 + J=J - A((^ - l) 1 ^, 1) 



/°° f ^ (a((:c p -1) 1/p , 1 + ^=1 -A((x p -l) 1 / p ,l)ldzdx 
Jx p {e)Jo I V vfc/ J 



/•a/ HMI 



+ Im ' f 4 { x i <yXP ~ 1)1/PA + 7k) +x ^ xP - l ) 1/p ^)} dzdx - 



Clearly, as n — > 00 , 

rM 



rivi r\\s\\ 

/ / D n dzdx < 2M\\s\\D n ^ 

J2 1 /p J-\\s\\ 



and also, with probability tending to one, 

r°° r\\ s \ 



Im 1 1| || { A " 1 + 7k) + A((XP ~ 1)VP ' 1} } ^ dX 

A =- ,1 cm 



A/ i-||s|| l + z/^/k \ l + z/y/k 
/ X((x p -l) 1 / p ,l)dzdx 

M J-\\s\\ 

r\\s\\ poo r\\ s \\ r°° 

< / / X(v,l)dvdz + / / X(u, 1) dudz 

-\\s\\ Ji/2(Mp-1) 1 /p J-\M JiMP-iy/p 



<4||s|| / X{v,l)dv. 
Ji/2(MP-iy/p 

As a result, 

(6.22) lim limsupPr{T 2 > k} = 0. 

M^oo n— >oo 

Combining (6.21) and (6.22) yields (6.18), which, in conjunction with the 
aforementioned result for A(C , P) e i i), yields (6.17). 

A.3. We now consider r n ^p (0) in (3.4). From (6.4), (6.6), (3.6) and the 
behavior of tail empirical and tail quantile processes, it follows that 

(6.23) sup \r n>p (6)\ ^ asn^co. 
ee[o,7r/4] 



TAIL DEPENDENCE ESTIMATION 



31 



Combining (6.16), (6.17) and (6.23) yields (6.1). So actually we proved the 
theorem for 9 E [0, 7r/4]. 

B. Observe that, using a symmetry argument, it rather easily follows from 
(6.1) with 6 = vr/4 that 

limlimsupPr[|v / fc{4> p (7r/2) - $ p (vr/2)} 

AJ.0 n^oo 

- {W A (C p<n/2 ) + Z p (tt/2)}\ > 6e] = 0. 

Observe in particular that the first term of Z CjP (7r/4) cancels out with the 
similar term coming from the mirror image (with respect to the line y = x) 
of Cp j7r/ / 4 . By a similar symmetry argument, observing that for 9 E (7r/4, vr/2) 
(the closure of) C P|7r / 2 \ C Pj e is the mirror image of C Pi7r / 2 -6>i it follows that 



lim lim sup Pr 

AJ.0 n ->oo 



(6.24) 



sup \Vk{%{9)-%{9)} 

9e(7r/4,7r/2] 



-{W A (C p> g)+Z p (9)}\>9e 
Combining (6.1) and (6.24) completes the proof. □ 



Proof of Theorem 3.2. The proof of this theorem follows in the same 
way as that of Theorem 3.1; only small adaptations are needed, including 
the obvious adaptation of the VC class A. The main difference between 
both results is the weaker condition (3.8) which allows A to put mass on 
{oo} x [0, oo) or [0, oo) x {oo}; on the other hand 9 is bounded away from 
and 7r/2 in the present result. In the limit process, the term W A {C Pt o) 
stays the same as in Theorem 3.1 but with weaker conditions on A; the 
term Z p (9) = Z CtP (9) + Za(9) may now be different from that in Theorem 
3.1, since there Zj = 0, which might not be the case here. Therefore, we 
confine ourselves to explaining how condition (3.8) is set to use and to the 
adaptation of that part of the proof that deals with Z^. 

Condition (3.8) implies that for some sequence T n 



(6.25) 



VkD Tn (k/n) + Vk/T n + l/Tl 



-.1/2 



0, 



oo. 



We focus on the bias term supgg^/^ \r n ^ p (9)\; see (3.4). For 9 E [ry,7r/4], 
write C p g = C\ U C 2 U C3 , where 



Ci = c>n([o,T n ] x [0,00]), 

C 2 = C p , e n \ [T n ,oo] x 
C 3 = C p , e n ([T n ,oo] x 



n 

°-- k Q 



n 



k 



Q 



2d 









n) _ 










(" 


j,oc 
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By the triangle inequality the bias term can be split up into three terms, 
based on C\, C 2 and C3, respectively. The first one of these terms converges 
to zero in probability, because the first term in (6.25) tends to 0. Using 



11 



-P{ [0,oo] x 



0,Q 2 „( - 

n 



n 



k 



k 



n 



T Q2n[ - = A [°'°°] X 



0,7Q 2 J - 



k 



n 



the second one can be handled similarly. For the third term we replace the 
difference in the definition of r n ^{9) by a sum and deal with both terms 
obtained from this sum separately. Using the behavior of tail empirical and 
tail quantile processes we obtain the convergence of both these terms from 
the convergence to of the second and third term in (6.25). 

Recall that Z d (6) = — $ p ({0})W2(l). We have to show the following ana- 
logue of (6.18): 



sup 

6>e[r/,7r/4] 



(6.26) 



X(x,y p (x)){W 1 (x)y' p (x) - W 2 (y P (x))} dx 

+ <M{0})TU 2 (1) 



x p (9) 



0. 



In view of the proof of (6.18), the proof of (6.26) is complete if we show 
that, as n — > 00, 



SUp |$ p ({0})u 2 „(l) + $p({0})W 2 (l)| = $ p ({0})|u 2 n(l) + W 2 (l) 
0efa,7r/4] 



0. 



But this immediately follows from (6.3). □ 



Proof of Theorem 3.3. The proof of Theorem 3.3 goes along the 
same lines of those of Theorems 3.1-3.2. Observe that we only have to con- 
sider the process \/fc($ p — <3? p ) on [77^ , 7r/ 2 — rj n ] and at tt/2, since on [0, rj n ) 
and (tt/2 — r) n ,7v/2) the process is constant and the limit process is con- 
tinuous on [0, 7r/2). Then we are in a similar situation as in Theorem 3.2, 
but now the interval under consideration depends on n and converges to 
(O.tt/2). 

The essential difference lies in the VC class A.. If we would adapt the 
VC class in the proof of Theorem 3.1 in the obvious way, that is, restrict 9 
to [?7 n ,7r/2 — r] n ], the VC class would depend on n and hence Theorem 3.1 
in Einmahl (1997) would not be applicable. We will, however, consider the 
VC class that is obtained from A of our Theorem 3.1 by omitting 6 = 0. Of 
course, (6.2) does not necessarily hold for this new class, but it can be shown 
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to hold when we replace jP{^-) by P( n ), the measure that is obtained from 
t-P( ) by projecting the probability mass of 

(6.27) -([T n , n/k] x ([0, 1 - AT 1 / 4 ] u [1 + k~ l '\ 3])) 
re 

on the axis {oo} x [0, oo), and by projecting the probability mass of 

(6.28) -(([Cl-^-^Ufl + fc- 1 / 4 ^]) x [T n ,n/k]) 
re 

on the axis [0, oo) x {oo}; here T n > 1jr\ n is a sequence of Ts for which (3.11) 
holds. The points j:(Un,Ui2), i = 1, . . . ,n, in the region (6.27) or (6.28) are 
projected on {oo} x [0, oo) or [0,oo) x {oo} in a similar way, that is, are 
replaced by (oo, f^) or (^Un,oo), respectively. It is easily seen that, with 
probability tending to one, this projection does not change the processes 
involved in the result. □ 

7. Proofs of Theorems 4.1 4.2. 

Proof of Theorem 4.1. Equation (4.9) is an immediate consequence 
of (4.8), so we focus on (4.8). 

Similarly it is immediate from Theorem 3.1 that, in D[0,n/2] and as 
re — > oo, 

(7.1) ^-Q,)! *'™*-*'™*' =»■ 

Recall the definition of / in (4.2) and observe that sup < e<7r / 2 \f{@)\ = 1. 
Put Ai n = f(@i n ) for i G I n . By (7.1) and since Q p {{ir/A}) < 1, necessarily 
Pr[3i G I n : Ai n ^ 0] — ► 1 as n — > oo. 
Consider the random function 

••m=££i& - 1<,,<L 

The derivative of is 

1 4 2 

Hence, on the event {3i G I n : ^Un / 0}, the function ^> n is strictly decreasing 
and there can be at most one fi n G (—1, 1) with \& n (/i n ) = 0. 

If 5: [0,7r/2] — ► K is absolutely continuous with Radon-Nikodym deriva- 
tive g' , then by Fubini's theorem, 



±- E 5 (6 in ) = / g(e)Q p (d9) 
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= g(n/2)- [ I*'* g'{$)d$Q p {de) 
•/ [0,71-/2] Jo 

= g(n/2)- f' 2 Q p ($)g'(<d)d$. 
Jo 

Since similarly JgdQ p = g(ir/2) - fi /2 Q P W(0)d#, by (7.1), 
^f^E^n)- / 9 dQ p ) 

\N n ^ J [0,^/2] J 

(7.2) = - r' 2 Vk{Q p {9) - Q p (6)}g'{6) d6 

Jo 

P p (9)g'(9)d9, rwoo. 



Here we used the fact that the linear functional sending x G D[0,7r/2] to 

Jo /2 x(e)g'(e)de is bounded. 

Since 1/(1 + a;) = 1 — x + x 2 / (1 + x) for x / —1, we have 

1 / n 2 A 2 

*^) = ^E^(i-M m+T ^x 

1 \ . 1 v -2 2 ^ \ ^ -^m 

iV ™ ie/„ iVn ie/„ iVn ie/ B 1 + ^ A%n 

Define 

An = ^ £ E = / /dQ P / / / 2 dQ p . 

Since J fdQ p = and J f 2 dQ p > 0, by (7.2), /2 n = O p (fc" 1 / 2 ) as ra->oo. We 
have 

*n(0) = ^E^« 

iVn ie/„ 

as well as 

1 I A 3 

tf„(2jz n ) = — Tjr- £ ^» + 4/£ — £ 



'tin Jtifi 

Because /2 n = o p (l), |j4j„| < 1 and A'"" 1 Z)ie/„ A \i ^ I f 2 dQ P > 0, we obtain 
lim Pr[|2/2 n | < 1, * n (0)^ n (2/2 n ) < 0] = 1. 
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Since moreover, with probability tending to one, is continuous and de- 
creasing, 

lim Pr[there exists a unique fi n G ( — 1, 1) such that ^f n (fl n ) = 0] = 1. 
Also, Pr(|/%| < 2|/2 n |) — > 1 and thus ft n = O p (k~ 1 ^ 2 ) as n— > oo. We have 

whence 



Lln = Hn + H n ^ J2 = fl n + O p {k ), TL ► OO. 



'V 

Define 



Since N^EiAin = O^k" 1 / 2 ) and iV" 1 E^i = I f dQ p + O^" 1 / 2 ), we 
have j2 n = fi n + O p (k~ 1 ) and thus 

Atn = An + O p (k~ l ), n->oo. 

Put 

Pin • — at (^ fj"nAin)i i G -^ru 

Q p (0):= ]>> m i(e m <0), 0G[O,tt/2]. 

Then 



_ 1 (An ~ t^njAjn + fl n jl n A 2 n 

-^n ^ :c r 1 "I" P"nAi n 

'tin 

Since both /2 n — \x n and /i n /2 n are O p (/c _1 ), 

(7.3) sup \Q P (0) — Q p (0) \ = O p (k~ 1 ), rwoo. 

6»e[0,7r/2] 

Therefore, as n— > oo and uniformly in 9 G [0,7r/2], 

Jk{Q P {Q) - Q P (9)} 

(7.4) = Vk{Q p {e)-Q p {e)} + o p {k~ l i 2 ) 

= Vk{Q p (6) - Q p (9)} - y/kpj[0) + GVAT 1 / 2 ), 
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where 

^) = ArE ^1(6*, <0), 0€ [0,71-/2]. 

n iei n 

The function / is absolutely continuous; denote its Radon-Nikodym deriva- 
tive by /'. By Fubini's theorem, for 6 £ [0, 7r/2], 

1(6) = [ fdQ p = f(6)Q p (6) - [ QpWf'W d-d, 
J[o,e] Jo 

1(6) = [ fdQ p = f(6)Q p (6) - f Q p ($)f'(0) d-d. 
J[o,e] Jo 

As a result, 

(7.5) sup \I(6) - 1(6)\ = O p (k- 1 ' 2 ), rwoo. 

6»e[0,vr/2] 

Moreover, by (7.2) with f = g, 

(7.6) v^ n = - r * r /2 Vk{Q p (6)-Q p (6)}f'(6)d6. 

JJ 2 dQ p Jo 

Write j3 niP = Vk(Q p - Q p ). Combine (7.4), (7.5) and (7.6) to see that 

(7.7) Vk(Q p - Q p ) = p n , p + jj^f p I + O p (k^) 
as n — > oo. Since the linear operator 

f xdf 

D[0, tt/2] -> D[0, tt/2] : x ^ x + J J I 

J f dQ p 

is bounded, (7.1) and (7.7) imply (4.8). □ 

PROOF of Theorem 4.2. It is immediate from Theorem 3.3 that, in 
D[0, tt/2] and as n — > oo, 

Vk(Q p - Q p ) o t ti 4- j3 p . 

Now the proof of Theorem 4.1 applies here as well, except for one change: 
we have to check that (7.2) still holds. But this follows from the fact that 

V2 Vk(Q p - Q p )(6)g'(6) d6 - Vk(Q p - Q p )(r n (6))g'(6) d6 

Jo 

is bounded by 

2Vkr) n sup \g'(6)\, 

8e[0,r ln ]U[TT/2-r, n ,TT/2] 

which by assumption tends to zero as n — > oo provided that g' is bounded 
in the neighborhood of and tt/2. This is the case for g = f and g = f 2 , the 
only functions to which (7.2) is to be applied. □ 
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